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1. Introduction 

Complex structure multidimensional item response theory (MIRT) is built on 
the idea that a single item, however simple it might be, carries the possibility of 
an inner structure. That is, in usual terminology one speculates that it is possible 
to measure several cognitive areas with one item. The number of cognitive areas 
so measured may vary among items, even though usual models assume that it 
is fixed for a collection of items (a test) and let a factor analysis type procedure 
decide on the number and mixture of cognitive areas measurable by the items. 

The point of view taken in this note is that any unidimensional item re- 
sponse theory (IRT) model can be thought of as a specialization of a MIRT 
model. Hence, the major task is to identify how much of the well established 
tools and nomenclature of unidimensional IRT can be preserved in the multidi- 
mensional context and, from the other direction, how different multidimensional 
notions may specialize to the same unidimensional entity. When the latter hap- 
pens, that is when two different multidimensional objects yield the same unidi- 
mensional specialization, then both multidimensional notions could be consid- 
ered proper generalizations of the underlying unidimensional quantity. A careful 
study should then be devised to decide which generalization is more appropriate 
with respect to the application at hand. 

There is, on the other hand, the possibility of not finding proper multidimen- 
sional generalization for some unidimensional notions. This topic also deserves 
careful research and understanding. 

Here, we consider what is termed complex structure MIRT. Usually, IRT mod- 
els have two components: the item likelihood and the population distribution. 
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In simple structure MIRT one item represents only one dimension and without 
a multivariate population distribution the entire likelihood of the model would 
factor as a product of univariate pieces. In complex structure MIRT this fac- 
torization is impossible, by definition, irrespective of population model chosen. 
Our main theorem will hold irrespective of the complexity of the structure. 

The structure of the paper is as follows. A short overview of unidimensional 
IRT is followed by the absolute^ that is coordinate system free, definition of 
MIRT. The connection with the usual approach is also shown via a discussion 
of two widely accepted models. Then, the development of the main thesis follows. 
In this we prove that MIRT models are all alike and they all can be obtained as a 
trivial extension of an appropriate unidimensional item response theory model. 
Two sections on some thoughts about capturing cognitive dimensions and on 
understanding the role of the notion of dimension-wise independence close the 
presentation. 

2. Unidimensional Item Response Theory 

To make the generalization to the multidimensional framework easier, let us first 
summarize some features of unidimensional IRT. Measurement takes place dur- 
ing the formation of the response matrix X G Mmxi{^) with elements Xni £ N 
for student n = 1, . . . , A'' and item i = 1, ...,/. In a dichotomous setting (which 
is assumed throughout the paper to simplify the presentation) Xni = 1 if student 
n responded correctly to item z, otherwise it is zero. As a major simplification of 
the modeling of the cognitive process it is assumed that the response to an item 
is stochastically determined by the ability 9 and item parameters Pi := {ai,bi,Ci) 
via the item response function ([l|): 

p3pi(0,/3,) Prob(x„, = 1 | 0, - c, + ^ ^ ^ ■ (1) 

There are, of course, many different item response functions in use, the three pa- 
rameter logistic model is chosen here only as an illustration. The other substan- 
tial simplification used in building the model is the assumption of independence 
of conditional probabilities Pj^f' across an arbitrary subset S C {!,... , A^} x 
{1, . . . , /} of student-item pairs. 

The two most popular models built out of these blocks are the joint unidi- 
mensional IRT and the marginal unidimensional IRT. Joint IRT states that the 
total likelihood depends explicitly on the ability of the given students: 

Lj-"*(X;e,£?) = n^'''H^«, (l-P'P'(^n,/30)'~"- (2) 
with corresponding log-likelihood: 

£)°"^\X- e,B) = J2 \og{P^P\0n,(3^)) + {1 - X^) log(l - P^P\0n, f3^)) . (3) 
n,i 
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Here, <d — {9i, . . . ,0^) and B = ...,/?/) are the collections of all abilities 
and item parameters, respectively. 

In the marginal theory the likelihood depends only on the distributional prop- 
erties of student's population: 

L-"S(X; *) = n / n ^')""* (1 ~ ^''"(^' A))'"""*d/.„(0) (4) 

n i 

with log-likelihood 

Viog / r\p'^He,f3,r-'ii-p'p\0,p,)y-^-^d^n{0), (5) 

where /^^ is the density measure of student n over M and $ is the collection of 
distributional parameters for student's ability. In parametric setting usually /i„ 
is given as 

d^ini0) = M0)<i0 

with some density function 
The quantities 

L^^iX; 0,B) = l[ P^P\0, (3,r-^ (1 - P3P1(0, (6) 

and 

n 

are the student and item likelihoods, respectively. 

A maximization of the joint model can be achieved by iteratively maximizing 
all the student likelihoods with fixed item parameters to obtain the next approx- 
imation of the abilities and all the item likelihoods with fix abilities to obtain 
the next approximation of item parameters. Starting values can be constructed 
from careful item analysis. 

It is worthwhile to analyze the shape of the student likelihood function. It 
is a product of the conditional probabilities of the actual responses over all the 
items administered to the student. As a function of the probability of the 
correct response is increasing when the actual response is correct and decreas- 
ing for incorrect actual response. As a consequence, a student likelihood will 
be increasing if all the actual responses are correct and decreasing if all the ac- 
tual responses are incorrect. This in turn pushes the location of the maximum 
likelihood solution for the given student to plus or minus infinity. For the item 
likelihood a similar statement holds. When at least two responses are different in 
each row and in each column of the dichotomous response matrix the existence 
of the unique maximum place is guaranteed in every step of the iteration. This, 
however does not necessarily imply that the iterative method will be convergent 
(0] gives a necessary and sufficient condition for the convergence of the joint 
Rasch model). The student likelihood can be well approximated by a normal 
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distribution (especially when the number of items is large enough) and its cur- 
vature will be inversely proportional to the asymptotic standard error of the 
ability estimates. 

The marginal model does not suffer from the same restriction so severely, be- 
cause the population density function, when chosen according to usual practice, 
will be sufficient to ensure the existence of a finite maximum place at each step 
of the iteration. In this case, the standard errors associated with the given row 
or column will be higher when constant response pattern is present. 

For this discussion to even make sense, we had to use the trivially available 
ordering of real numbers (playing the role of ability space in the unidimensional 
case) to use such notion as "increasing ability" . This point will be central to the 
multidimensional extension, since there will be no natural choice of ordering of 
multidimensional abilities. 

3. Multidimensional Item Response Theory 

3. 1 . Introduction 

In what follows we explore the possibility of defining MIRT in geometric terms 
without direct reference to coordinates. As before, the full response likelihood for 
a student is formed by multiplying single item conditional probabilities together 
(invoking the assumption of local independence). The likelihood of an MIRT 
model is then constructed by incorporating some sort of population model with 
these response likelihoods similar to the joint and marginal univariate cases 
(Equations [2 HI). 

The classification of MIRT is achieved at the level of a single item conditional 
probability in the same way as we would characterize a univariate IRT model as 
Rasch, 2PL or 3PL model. This does not mean that we restrict our presentation 
to single item tests. Realistic tests are treated using the local independence 
assumption as discussed before (Equations [2l HI and [6]). 

With this now clarified, from what follows, unless noted otherwise, we shall 
drop any reference to any particular student and item. This will also help us 
avoiding overflow of indices in the multidimensional context. 

3.2. Basic Models 

Even though widely investigated, MIRT is not yet widespread as an operational 
model. Hence, identifying the major players among the c omp eting MIRT models 
is difficult. Here, only two models are discussed, one by [l^l and another one by 

First, for an item we associate a vector of discriminations a — (ai, . . . , ao) G 
MP and a vector of difficulties h = (6i, . . . , fo^) G MP . With these the functional 
representation of the dimension-wise independent MIRT response likelihood of 
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12l | has the form 

d— 1 

where 6* = {6i, . . . ,9d) € M^. If the conditional probabihty of passing the 
dth dimension of the item is given by -[^^^-^^"(D^-bj) ; then ^ can indeed be 
understood as the joint probabihty of passing all the independent dimensions of 
the item. Unless there are separate observed scores for each dimension, language 
like "correct response on dimension d" cannot be used. In lack of this we used 
the "passing a dimension" term, which may refer to an unobservable event. 




Fig 1. Dimension-wise independent and Scalar Product MIRT hypersurface 

(see also [l^l ) put forward a model in which the response likelihood takes 
the functional representation 

/-:M-^[0,1], g^.CT)= ^^^_<„ (9) 

where a is as before and 6 G M. (a; | = X^dLi XdUd is the usual scalar product 
oi x,y ^ M^. We use the term Scalar Product MIRT to refer to this model. 

As a last step before embarking on the dimension free definition of MIRT 
let us write the marginal likelihood of the Scalar Product model assuming mul- 
tivariate normal population distribution. Using the notation of the previous 
section, the conditional probability of the response x„i for item i and student n 
is given by 

P(x„. I 9, P. = {a. A) e X M) = ^^^_^,^J,,^^^^\eH,^y (10) 
Then, the likelihood of the Scalar Product MIRT model is given by 

L{X; ^, *) - n / n ^(^™ I ^' ^')'^(^; ^n)d''e, (11) 
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where Lp{9]Vm'^n) is the muhivariate normal density with (possibly) student 
dependent mean Vn and covariance S„. Moreover, B — {f3i, . . . , is the collec- 
tion of item parameters and <& — (vi, Ei, . . . , vn, S^r) is that of the population 
parameters. 

The multidimensional student likelihood is given by the product of individual 
conditional probabilities over items administered to a given student n: 

Y[Pixn^\9,/3,). (12) 

i 

Figure [4] depicts a possible student likelihood in the two dimensional case. 

The likelihood (jlip is a multidimensional generalization of the univariate 
marginal likelihood given by ([4]). 

3.3. Definition of MIRT 

Our goal in this section is to give a definition of MIRT with as few assumption as 
possible. Multidimensional item response theory postulates that with one single 
item multiple cognitive abilities could be detected. To accommodate this idea, 
one has to change the model for the ability space from the one dimensional vector 
space M to a finite dimensional vector space Vg. While any finite dimensional 
vector space V is linearly isomorphic to for D — dim{V) (see (|14p for 
an explicit way of constructing such an isomorphism) , this isomorphism is not 
canonical (there is not a unique isomorphism V M^). By this, and other 
reasons that will become clear as we proceed, we chose not to use as a 
mathematical model of ability space. 

The reader unfamiliar with these notions is referred to for an excellent 
introduction to linear algebra. Also, an intuitive understanding of the basic no- 
tions of smooth manifolds should help understanding of what follows, although 
not strictly necessary. Among the many fine references to the topic the interested 
reader may find [lH useful. 

The basic object in unidimensional IRT is the item response function (IRF) 
and its graph, the item response curve (IRC). Recall, that the graph of a function 
/ : A ^ B is a subset graph(/) = {(a;, f{x)) £ A x B \ x e A} of A x B. IRC 
is a one dimensional smooth submanifold of M x [0, 1]. While there is a scaling 
freedom even in the one dimensional case (e.g. the (in)famous 1.7 multiplier 
in the logistic models), the possibility of ambiguous interpretation is minimal 
and one may use the functional (IRF) and the geometrical (IRC) representation 
almost interchangeably. 

In the multidimensional case, however, the matter is not so straightforward. 
As we shall see, the functional and the geometric representations are different 
in a subtle way. One way to keep the presentation coordinate system free in 
multidimensional IRT is to postulate that the theory is given by an item response 
hypersurface (IRHS). As in the unidimensional case, the IRHS is used to express 
the probability of correct response given an ability in Vg. 
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Before defining tliis notion, let us fix some notations. For any w G Ve tlie ray 
of V is defined to be the line M-w in Ve determined by v: M-u = {\v G Ve | A G R}. 
Similarly, for v,w G Ve the u-dirccted line going through w is defined by 

w + ^ {w + \v ^Ve I A G R}. 
For the notion of IRHS we then have the following 

Definition 1 A dichotomous item response hypersurface (IRHS) is a D = 
diTa(Ve) dimensional smooth submanifold M of Ve x [0,1], so that for any two 
vectors v,w &Ve the intersection of (it; + R • u) x [0, 1] and M is a graph of a 
monotonic function w ■ v ^ [0,1]- 

We shall say that a MIRT model is given when an IRHS is given. 

Note, that while w + R • w is not canonically isomorphic to R, monotonicity of 
the map 

: w + R-w ^ [0,1], \^ f^,,^{w + \v) (13) 

can be unambiguously defined by requiring that either fv,w{w + ^v) < fv,w{w + 
fiv) or fv,w{w + \v) > fv,w(w + jJLv) for all A,/i G R whenever \ < ii. We shall 
use the notation /„ = fy^. Figure [2] shows the intersection of (w + R • u) x [0, 1] 
and M in two dimensions. 

To understand the definition better, let us first assume that we choose v to 
be arbitrary and w — Q 'm the definition above. Then, the line w + R • w = R • w 
can be understood as an ability direction. The monotonicity requirement of 
Definition [1] asks for the natural feature that as the ability given by v increases 
the probability of the correct response increases as well. For non-zero w the 
requirement is equivalent to the conditional probability of correct response being 
monotonic with respect to one ability when the rest of the abilities are fixed to 
a certain not necessarily zero value. To be precise, we should say that for w 7^ 
there exists a basis of Ve so that the monotonicity requirement reads as the 
interpretation above. Furthermore, for any basis of Ve Definition [1] will ensure 
the monotonicity of the conditional probability of correct response for any ability 
direction given any fixed values for the rest of the ability directions (as defined 
by the basis). 

Note also that the collection of maps f^^w for v,w G Ve defines the IRHS 
completely. For this reason, we shall use the notation f^ ^ or / if no confusion 
may arise, for the function describing the IRHS M C Ve x [0, 1]. 

One may be tempted to object to the use of notions like manifold and hyper- 
surfaces. It is very important to note, however, that the conditional probability 
of correct response has been given by a hypersurface in the usual MIRT lit- 
erature as well. One major difference in terminology is that it was still called 
surface in any dimension, which is a correct usage only in dimension two. In 
higher dimensions, the object at hand is a hypersurface, a special case of higher 
dimensional manifolds. 

A basis D = (wi, . . . , v/j) in Ve defines a unique isomorphism 

D D 

zo : -> K^, ^ Kv, ^ ^ A.e„ (A. G R), (14) 

1=1 i=l 
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Fig 2. Intersection o/ ui + R ■ X [0, 1] and IRHS. The intersection is the bold curve, which 
is required to be monotonic. 

where (e^)^]^ is the standard basis of R^: {ei)j = 5ij {Sij is the Kronecker 
delta). This isomorphism can be triviaUy extended to a diffcomorphism 

^„ : V^e X [0,1] ^K^ X [0,1], {v,t) ^ {i„{v),t) (15) 

and via this diffeomorphism we may transfer the IRHS from Ve x [0, 1] to x 
[0, 1]. Now, in x [0, 1] the image of the IRHS may be given by the graph 
of a smooth function / : [0, 1]. Note, however the important difference 

between using a functional representation like this latter one and using the 
hypersurface representation directly in Ve x [0, 1]. The functional representation 
depends on the basis we chose to establish the diffeomorphism ipa and different 
bases may result in different functional representations. 

Note: It is tempting to extend this definition to polytomous multidimensional 
items by defining the polytomous collection of item response hypersurfaces for a 
polytomous item by requiring that the above discussed intersection be a collec- 
tion of unidimensional polytomous item response curves as produced by some 
unidimensional polytomous IRT model (e.g. Muraki's' partial credit model Q). 
The investigation of this possibility is postponed for a forthcoming paper. 

3.4. Properties of IRHS 

In this section we prove the main theorem of the paper. For the sake of trans- 
parency, we start with the two dimensional case which is then followed by the 
more involved general theory. 
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3.4- 1- Two Dimensional Case 

Using the monotonicity of the model we can prove an interesting elementary 
property. 

Lemma 1 In any 2 dimensional MIRT model there exists a line in Vg through 
the origin so that fy is constant. 

Proof: Let us choose a vector v €Vg. Note, that if inf /^(Au) = sup/„(Aw) the 

lemma is proved, the sought after line is M • i". Therefore, we may assume that 
inf fvi^v) < sup/t,(Aw). For such a vector either lim fv{Xv) = sup (Aw) or 

lim fvi^v) = sup/i,(Au). Let P be the set of vectors satisfying the first and 
A^-oo xm 

N be the set of vectors satisfying the second condition. Both of these sets are 
non-empty and by continuity, both sets are open. Also, they are clearly disjoint. 
Therefore, there is a vector u Q Vg so that u ^ N IJ P. Along the line R • u the 
function / is constant. ■ 

Note that the proof only uses monotonicity with w — 0. Utilizing it for general 
w the same argument provides the following 

Lemma 2 In any 2 dimensional MIRT model, through any point w £ Vg there 
exists V Vg so that along the v-directed line going through w the function fy^^ 
is constant. 

We introduce the term w- constant line, or simply constant line, for the u-directed 
line going through w as in Lemma [2] 



f=const. 



f=const. 



Fig 3. Non-unique constant line results in constant MIRT model. 

Analyzing the properties of these constant lines further we see that they are 
actually parallel to one another. That is we have the following 

Lemma 3 Let w,w' G Vg be two points. Let v,v' G Vg be the corresponding 
directions of the two constant lines. Then v — fiv' for some /i G M. 
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Proof: First, we note that if there is a point w e Ve so that there exist two 
w-constant lines, then the model is trivial (/ is constant) and the statement is 
true. For, let w' and w" be the intersections of a general position line in Vg with 
the two w-constant lines, respectively. Because / is monotonic along this line, 
and f{w') = f{w") f is constant between w' and w" , that is f {tw' + {I ~ t)w") = 
f{w') for all t G [0, 1]. Using this argument for every line in general position 
proves that / is constant everywhere (see Figure [3]) . 

Now, we assume that the constant lines through w and w' are unique. If the 
two lines are not parallel then they will have an intersection and an argument 
similar to the previous one shows that / is constant. ■ 

The corollary of the previous observation is the 

Theorem 1 Any 2 dimensional MIRT model is a trivial extension of a unidi- 
mensional IRT model. 

Proof: We saw in Lemma[3]that a 2 dimensional IRHS is nothing but a collection 
of parallel lines. Let v ^ Vg he the direction of these lines. Choosing a transversal 
R • u (a line that intersects all of them) to this collection the IRHS can be given 
by the function : M ■ m — > [0, 1]. For, let us express an arbitrary w g as a 
unique linear combination w ~ iiu + Xv and write 

f{w) = f{fm + Xv) = f^{^iu). (16) 

This function can be thought of as a unidimensional IRT model. ■ 

3.4-2. D-Dimensional Case 

Technically, the D dimensional case is not that much more complicated than 
the 2 dimensional one. It is just much more difficult to visualize the correspond- 
ing geometric objects. As we pointed out earlier, the conditional probability 
"surface" is not 2 dimensional, so strictly speaking it is not a surface in higher 
dimensions. Our three dimensional training does not allow us to "see" objects 
in higher dimensions. The formalism we built in the previous section, however, 
will be applicable, with appropriate modifications, to this situation as well. 

The proof of Lemma[l]works for any dimensions. Applying the monotonicity 
argument for arbitrary (w, w) as above proves the corresponding 

Lemma 4 In any MIRT model there exists a hyperplane Hu, in Vg through 
w & Vg so that fn^ is constant. 

Proof: Here, fn^, is the restriction of / to the hyperplane 11^. As before, we 
prove w = explicitly; the general case follows the same argument. Let us, 
as before, define the open sets P and N and note that P — —N. Exclude the 
trivial case of P = 0. It is clear that PUN ^ Vg. Locally the boundary of P 
(the closure of P minus P) is a Z3 — 1 dimensional submanifold [D = dimVe). 
Therefore there exists a collection (ci, . . . ,cd-i) of points in Vg\{PU N) so that 
(ci — w, . . . , cd-1 — w) spans a hyperplane H^. Now, along the line segment 
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joining any point on R • (ci — w) with any point on M • [cj — w), for some i ^ j, 
the restriction of / should be constant (Figure ^ . 

Repeating this argument for each pair of hne segments shows that along 
the entire hyperplane / is constant. If there is another hyperplane with this 
property, then P = 0, which is excluded. I 

Now, a Z? — 1 dimensional hyperplane is to a dimensional space as a line 
is to the plane. Using this intuition it is not difficult to adapt the formal proof 
of Lemma |3] to prove 

Lemma 5 Let w, w' G Vg be two points. Let H^, H^' C Ve be the corresponding 
two constant hyperplanes. Then and H^i are parallel. 

Now, we are ready to rephrase our main theorem in arbitrary dimension. 

Theorem 2 Any MIRT model is a trivial extension of a unidimensional IRT 
model. 

For the sake of explicitness let us write /^^ for an arbitrary IRHS in terms 
of univariate IRT model. Let us fix a transversal u e Ve to the collection of 
constant hyperplanes. First, we observe that for any w G there is a unique 
decomposition w — fiu + Xv with v e ■ Then, 

f\^,u + Xv)^f::'{flu). (17) 

Note that if we choose the usual 2P1 or 3PL models the construction yields the 
Scalar Product model. It is also interesting to note that the MIRT generalization 
of the Rasch model is equivalent to the generalization of the 2PL model. This is 
because, while within the univariate Rasch model one may assume that the slope 
is fixed, when more dimensions are considered simultaneously the assumption of 
equal slopes is not valid. The relative positions of slopes to one another should 
be determined during the estimation procedure in lack of a priori information. 

This kind of models were called generalized compensatory models (GMIRT) 
in [^]. The link hmction of an IRHS as GMIRT is /*^. 

3.4.3. Absolute Functional Representation for the Scalar Product Model 

A notable feature of the Scalar Product model is that using the dual of a vector 
space it can be defined without referring to coordinates even in its functional 
form. First, we recall that the dual V* of a finite dimensional vector space 

V is the finite dimensional vector space of the same dimension of linear maps 

V ^R: 

V* :={p:V ^R\pis linear}. (18) 
The duality is the obvious map 



{\):V* xV ^R, {p,v) >^ {p \ v) := p{v). 



(19) 
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That is, for any p e V* and v ^ V the quantity ( p | u) is a real number. It 
is important to note that the duahty, unhke a scalar product, does not involve 
any choice. 

Now, if in MIRT we make the choice, that the ability is modeled by the vector 
space Vg as before and the item is modeled by the discrimination a S Vg in the 
dual space and a real number h then the IRHS of the model is given as the graph 
of the following function: 

fi, : Ve - [0, 1], fUe) := (20) 

In addition to its very satisfying and elegant nature this model has the computa- 
tional advantage of having the same functional representation in any coordinate 
system. As we shall see later the dimension-wise independent model does not 
share this nice invariance property. 

3.4-4- Interpretation of Main Theorem 

The statement of the main theorem excludes many existing MIRT models from 
the pool of monotonic MIRT models. The author's reading of the main theorem 
is that the only relevant MIRT model is the one defined in (|17p . This interpreta- 
tion is backed by the fact the widely used and tested estimation tools exist only 
for the Scalar Product model, the most relevant of the above extensions f[lo|). 
In the view of Theorem [5] there seems to be a good reason behind that. It seems 
that lack of monotonicity prevents one to maximize the likelihood function of 
MIRT models excluded by our approach. This certainly defines a valid future 
research direction. Also, the existence of an elegant coordinate free functional 
representation makes the Scalar Product model even more appealing. 

On the other hand, model building always has many steps that cannot be 
entirely backed by theoretical considerations. The process sometimes is dictated 
by personal preferences and tastes. It is possible that some readers may not be 
willing to except the requirement of monotonicity as formulated in Definition 
[1] as a crucial and necessary feature of an MIRT model. For those readers the 
main theorem is interpreted a bit differently. First, we note the close connection 
between the notion of compensatory model to monotonicity. Usual terminology 
is that the model is compensatory, if the probability of the correct response 
may be high even with the lack of ability in all but one dimension. That is, 
sufficiently high ability in one dimension is able to compensate for the lack of it 
in other dimensions. In fact, compensatory property follows from monotonicity 
as an easy application of Theorem [21 If compensatory property is understood 
in a sense that it is true in any coordinate system, then the reverse is also true, 
and the two notions are equivalent. With this in mind the theorem states that 
any compensatory MIRT model is a direct generalization of a univariate IRT 
model. 

In either way. Theorem [2] establishes a prominent role for the Scalar Product 
model as an MIRT model. 
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3.5. Estimation in MIRT 

Let us now restrict our attention to the Scalar Product model. A typical two 
dimensional {D — 2) student likelihood is given in Figure S] As in unidi- 
mensional IRT, the maximum place of this function plays a special role in the 
estimation of MIRT model parameters. A curious feature of this graph is that a 
pronounced unbalance can be observed between the standard errors of the two 
ability estimates. Here, standard error is understood as the inverse of the cur- 
vature of the graph at the maximum place. There is a well identified direction 
in which the standard error is minimal and in the direction orthogonal to this 
the standard error appears to be much bigger. One may even say that, despite 
our efforts, the model shows definite signs of unidimensionality. 

The reason behind this is very simple. A student likelihood is formed as a 
product of probabilities of the actual responses given by item response hyper- 
surfaces similar to the one shown on the RHS of Figure [TJ These hypersurfaces 
are always increasing towards the first quadrant (correct response) or towards 
the third quadrant (incorrect response). Hence, the product of these will be the 
above observed "ridge" of Figure 21 It is a ridge because the observed response 
is either correct or incorrect and no distinction is made between events of the 
students using only one of the dimensions correctly during the assessment. In 
other words, since there is no observed data for the different dimensions, the 
model will not be able to provide two distinct, meaningful estimates for the 
abilities of the person on the different cognitive dimensions. 




Fig 4. Scalar Product MIRT student likelihood. 
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3.6. Dimension- wise Independence 



The careful reader should have noticed, that concerning one particular point 
the presentation is not faithful to its own principles. That is, the notion of 
dimension-wise independence was used without any discussion of its invariance, 
or coordinate system independence. It is easy to see that the dimension-wise 
independent model does not satisfy the requirement of monotonicity, therefore 
wc would not consider it as a valid MIRT model. On the other hand, it might 
be useful to see explicitly how badly the the functional representation of the 
dimension-wise independent model behaves to appreciate the niceties of the 
Scalar Product model even more. 

Invariance of dimension-wise independence for the the model 

/.-.:M--[0,1], W) = nTT^3,W^ (21) 

would mean that the factorization property holds in any other coordinate sys- 
tems. 

Mathematically, this would require that for any invertible matrix G € GL{D) 
{G expresses change of coordinates) wc have a function 

/i*^ : M X M X M ^ M, {ai,bi,t) h°{ai,hi,t) 

and a pair of invertible matrices U,V & GL(D) so that when 6 = G ■ d' {0,6' Q 
M^) we have a factorization 

D 

/a";6W = n'^''((^°)'i'(^^)'^'^^) (22) 

that is 



fZb{G ■ 9') 

D 



TT i 



(23) 



with a'^ = {Ua)d and 5^ — {yb)d- The role of U and V is to ensure that the 
function h'^ is the same for all factors in the product by allowing this function 
to depend on different linear combinations of the elements of a and of b. 

To show that this is too much to ask for in general, let us first assume that a 
factorization f{x, y) = h{x)g{y) holds for some function / so that /i(0) ^ and 
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5(0) ^ 0. Then, 

h{x) 

9{y) 

and 



/(^,0) 
5(0) ' 

/(o,y) 
MO) ' 

1 f{x,y) 



h{0)g{Q) f{x,0)f{0,yy ^^^^ 
Now, for the sake of concreteness, let us take D ~ 2 and a = (ai, ai) e and 
b = (0, 0) e M^. Also, let us take G = ( ! ] ]■ With these, ^ becomes 



h{9[)g{e',) (25) 



X 1 -1 

1 1 



with some h, g : M. ^ M.. From the function 



1 _ i+e-°i"''l-''2) 

1 1 1 v^"i 



^(0)5(0) 



should be constant. This is clearly not the case, showing that the factorization 
((23|) does not hold in general. 

It seems that the definition of dimension- wise independence is not an absolute 
one. We have a choice of either dropping it altogether, or if need arises, we 
may change it. To formulate this notion we have to relax the monotonicity 
requirement of MIRT in Definition [T] by requiring the monotonicity of the /„ for 
all w e Ve , that is assumed w is zero in Definition [1] and in [T31 Let us call this 
type of models ray-wise monotonic MIRT models. 

Definition 2 A ray-wise monotonic MIRT model given by an IRHS is dimension- 
wise independent if there exists a coordinatization of abilities so that the func- 
tional representation of the model fa,b{()) can be written as a product of factors 

D 

/a,6(e) = n^(«'^'^'^'^^')- (27) 

d=l 

The specialty of this property comes from the fact that for a general IRHS it is 
very rare that the functional representation can be factored so that one may con- 
sider it dimension- wise independent. This interpretation was used throughout 
the paper, when the Whitley model was called dimension-wise independent. 



4. Conclusion 



A coordinate free definition of MIRT has been put forward in the paper. Our 
main argument is that in a coordinate free setup it is easier to tell apart genuine 
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MIRT objects from potential artifacts. These artifacts can be notions and rela- 
tionships that should not be considered integral parts of the model since their 
key features which may be apparent in one could vanish in another coordinate 
system. We showed that it is possible to provide a full classification of monotonic 
models solely based on general, coordinate-free considerations. 

It is important to note that the classification was carried out at the level of 
a single item IRHS, but it is in no way restricted to single item tests. IRT and 
MIRT models handle tests by invoking the local independence assumption and 
form the likelihood of the model by multiplying single item conditional probabil- 
ities together. The flavor of the test (Rasch, 2P1, normal ogive, compensatory, 
polytomous, etc.) is always given at the single item level. Our treatment is no 
exception. 

It is very important that the reader does not mistake the promotion of the 
coordinate free description for an argument for a completely coordinate free han- 
dling of the entirety of MIRT. In fact, it should be explicitly stated that without 
a choice of coordinates meaningful MIRT practice cannot exist. In addition to 
this, every discussion of MIRT features can be fully carried out using R-^ as the 
main model space for abilities. Should such a path be chosen, however, one has 
to be careful to meticulously maintain the coordinate system invariance of the 
theory every step of the way. The contribution of this paper is an introduction 
of a framework to ease this burden by keeping the presentation absolute (with- 
out choosing any coordinates) for as long as possible. The paper shows that 
one may be able to formulate general statements and reach valuable insights 
before switching to relative mode by an introduction of a particular basis. It is 
likely that someone may observe the relevance of a notion while in a particular 
coordinate system and may want to establish whether it is invariant by trying 
to create a definition in the absolute framework presented here. 

It is noteworthy, that the necessity of the existence of a coordinate free rep- 
resentation of our physical world led Einstein to formulate both the special and 
the general theories of relativity (0; 0|). The fundamental dogma in relativity 
theory is that the events of the physical world take place without being aware of 
any coordinate system. Therefore, any faithful description should be invariant 
of the change of coordinate system. Better yet, a description of the physical 
world is sought that bypasses the use of coordinates altogether. 

A reader interested in the successes of coordinate free description of the 
physical world may also find the books useful. 
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